Investigating Entropy for Extractive Document Summarization

نویسندگان

چکیده

Automatic text summarization aims to cut down readers’ time and cognitive effort by reducing the content of a document without compromising on its essence. Ergo, informativeness is prime attribute summary generated an algorithm, selecting sentences that capture essence primary goal extractive summarization. In this paper, we employ Shannon’s entropy sentences. We Non-negative Matrix Factorization (NMF) reveal probability distributions for computing terms, topics, in latent space. present information theoretic interpretation computed entropy, which bedrock proposed E-Summ unsupervised method The algorithm systematically applies principle informative from important topics document. generic fast, hence amenable use documents real time. Furthermore, it domain-, collection-independent agnostic language Benefiting strictly positive NMF factor matrices, transparent explainable too. standard ROUGE toolkit performance evaluation four well known public data-sets. also perform quantitative assessment quality semantic similarity w.r.t original Our investigation reveals though using approach promises efficient, explainable, independent summarization, needs be bolstered match deep neural methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extractive Document Summarization

We present two novel and contrasting Recurrent Neural Network (RNN) based architectures for extractive summarization of documents. The Classifier based architecture sequentially accepts or rejects each sentence in the original document order for its membership in the final summary. The Selector architecture, on the other hand, is free to pick one sentence at a time in any arbitrary order to pie...

متن کامل

Semi-extractive Multi-document Summarization

In this thesis, I design a Maximum Coverage problem with KnaPsack constraint (MCKP) based model for extractive multi-document summarization. The model integrates three measures to detect important sentences including Coverage, rewards sentences in regards to their representative level of the whole document, Relevance, focuses to select sentences that related to the given query, and Compression,...

متن کامل

Extractive spoken document summarization for information retrieval

The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling...

متن کامل

Extractive Multi-document Summarization Using Multilayer Networks

Huge volumes of textual information has been produced every single day. In order to organize and understand such large datasets, in recent years, summarization techniques have become popular. These techniques aims at finding relevant, concise and non-redundant content from such a big data. While network methods have been adopted to model texts in some scenarios, a systematic evaluation of multi...

متن کامل

Classify or Select: Neural Architectures for Extractive Document Summarization

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems With Applications

سال: 2022

ISSN: ['1873-6793', '0957-4174']

DOI: https://doi.org/10.1016/j.eswa.2021.115820